Frequency Domain-Based Detection of Generated Audio

نویسندگان

چکیده

Attackers may manipulate audio with the intent of presenting falsified reports, changing an opinion a public figure, and winning influence power. The prevalence inauthentic multimedia continues to rise, so it is imperative develop set tools that determines legitimacy media. We present method analyzes signals determine whether they contain real human voices or fake (i.e., generated by neural acoustic waveform models). Instead analyzing directly, proposed approach converts into spectrogram images displaying frequency, intensity, temporal content evaluates them Convolutional Neural Network (CNN). Trained on both genuine voice synthesized signals, we show our achieves high accuracy this classification task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Newborn EEG Seizure Detection Based on Interspike Space Distribution in the Time-Frequency Domain

This paper presents a new time-frequency based EEG seizure detection method. This method uses the distribution of interspike intervals as a criterion for discriminating between seizure and nonseizure activities. To detect spikes in the EEG, the signal is mapped into the time-frequency domain. The high instantaneous energy of spikes is reflected as a localized energy in time-frequency domain. Hi...

متن کامل

Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction

In this paper, we re-visit an original concept of speech coding in which the signal is separated into the carrier modulated by the signal envelope. A recently developed technique, called frequency domain linear prediction (FDLP), is applied for the efficient estimation of the envelope. The processing in the temporal domain allows for a straightforward emulation of the forward temporal masking. ...

متن کامل

Progress in LPC-based frequency-domain audio coding

This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/ licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obta...

متن کامل

Robust Audio Watermarks in Frequency Domain

In this paper an audio watermarking technique is presented, using log-spectrum, dirty paper codes and LDPC for watermark embedding. This technique may be used as a digital communication channel, transmitting data at about 40 b/s. It may be also applied for hiding a digital signature, e.g., for copyright protection purposes. Robustness of the watermarks against audio signal compression, resampli...

متن کامل

Drum Detection from Polyphonic Audio via Detailed Analysis of the Time Frequency Domain

This publication presents a method for the automatic detection and classification of three distinct drum instruments in real world musical signals. The regarded instruments are kick, snare and hi-hat as agreed by the participants of the contest category Audio Drum Detection within the 2nd Annual Music Information Retrieval Evaluation eXchange (MIREX 2005). There are two challenging issues inher...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IS&T International Symposium on Electronic Imaging Science and Technology

سال: 2021

ISSN: ['2470-1173']

DOI: https://doi.org/10.2352/issn.2470-1173.2021.4.mwsf-273